智能论文笔记

Designing Low-Correlation GPS Spreading Codes with a Natural Evolution Strategy Machine Learning Algorithm

Tara Yasmin Mina , Grace Xingxin Gao

分类：机器学习

2021-01-08

随着下一代GPS III星座的诞生和即将推出的导航技术卫星-3（NTS-3）测试平台来探索GPS的未来技术，我们确实进入了卫星导航的新时代。相应地，是时候重新审视GPS扩散代码系列的设计方法了。在这项工作中，我们开发了一种具有高斯建议分布的自然演化策略（NES）机器学习算法，构建了扩展码序列的高质量家庭。我们最小化平均平衡自相关和平均平均互相关之间的最大值，并展示了我们算法实现更好的性能，而不是所选择的相等长度金代码和Weil代码的良好性能，适用于UP的序列长度-1023和长度-1031位和多达31个代码的系列尺寸。此外，我们将算法与类似的遗传算法实现进行了比较，分配了相同的代码评估度量。据作者所知，这是第一个使用机器学习方法来设计导航扩展码序列的第一项工作。

translated by 谷歌翻译

A Case Study Analysis for Designing a Lunar Navigation Satellite System with Time-Transfer from Earth-GPS

Sriramya Bhamidipati , Tara Mina , Grace Gao

分类：机器人

2022-01-03

最近，对使用未来月球导航卫星系统（LNSS）的小型平台越来越兴趣，以允许成本效益和快速部署。但是，许多设计选择尚未为基于小型的LNSS进行最终确定，包括板载时钟和轨道类型。与传统的地球GPS相比，设计LNSS造成独特的挑战：（a）板载时钟的限制尺寸，重量和电源（交换）限制了定时稳定性; （b）有限的月球地面监测站，为稳定的LNSS卫星轨道进行了更大的偏好。在本前的工作中，我们分析了与车载时钟和月球轨道类型相关的不同设计考虑因素之间的权衡，用于设计LNSS从地球GPS的时间转移。我们所提出的时转架构将间歇可用的地球GPS信号组合在定时滤波器中以缓解板载时钟的成本和交换要求。具体地，我们通过不同等级的低交换时钟和先前研究的月球轨道类型进行多种案例研究。我们估计月球用户等效范围错误（UERE）度量标准，以表征从LNSS卫星发送的信号的测距精度。使用Systems工具套件（STK） - 基于Analytical Graphics，Inc。（AGI）的基础模拟设置，我们评估了Lunar Uere对LNSS设计的各种案例研究，以证明具有传统地球GPS的可比性，即使是遗产使用低交换板上的时钟。我们进一步对敏感性分析进行了敏感性分析，以研究不同案例研究的月球UEE度量的变化，因为地球-GPS测量更新率变化。

translated by 谷歌翻译

Evaluating Human-Language Model Interaction

Mina Lee , Megha Srivastava , Amelia Hardy , John Thickstun , Esin Durmus , Ashwin Paranjape , Ines Gerard-Ursin , Xiang Lisa Li , Faisal Ladhak , Frieda Rong

分类：自然语言处理

2022-12-19

Many real-world applications of language models (LMs), such as code autocomplete and writing assistance, involve human-LM interaction, but the main LM benchmarks are non-interactive, where a system produces output without human intervention. To evaluate human-LM interaction, we develop a framework, Human-AI Language-based Interaction Evaluation (H-LINE), that expands non-interactive evaluation along three dimensions, capturing (i) the interactive process, not only the final output; (ii) the first-person subjective experience, not just a third-party assessment; and (iii) notions of preference beyond quality. We then design five tasks ranging from goal-oriented to open-ended to capture different forms of interaction. On four state-of-the-art LMs (three variants of OpenAI's GPT-3 and AI21's J1-Jumbo), we find that non-interactive performance does not always result in better human-LM interaction and that first-person and third-party metrics can diverge, suggesting the importance of examining the nuances of human-LM interaction.

translated by 谷歌翻译

Graph-based Semantical Extractive Text Analysis

Mina Samizadeh

分类：自然语言处理 | 机器学习

2022-12-19

In the past few decades, there has been an explosion in the amount of available data produced from various sources with different topics. The availability of this enormous data necessitates us to adopt effective computational tools to explore the data. This leads to an intense growing interest in the research community to develop computational methods focused on processing this text data. A line of study focused on condensing the text so that we are able to get a higher level of understanding in a shorter time. The two important tasks to do this are keyword extraction and text summarization. In keyword extraction, we are interested in finding the key important words from a text. This makes us familiar with the general topic of a text. In text summarization, we are interested in producing a short-length text which includes important information about the document. The TextRank algorithm, an unsupervised learning method that is an extension of the PageRank (algorithm which is the base algorithm of Google search engine for searching pages and ranking them) has shown its efficacy in large-scale text mining, especially for text summarization and keyword extraction. this algorithm can automatically extract the important parts of a text (keywords or sentences) and declare them as the result. However, this algorithm neglects the semantic similarity between the different parts. In this work, we improved the results of the TextRank algorithm by incorporating the semantic similarity between parts of the text. Aside from keyword extraction and text summarization, we develop a topic clustering algorithm based on our framework which can be used individually or as a part of generating the summary to overcome coverage problems.

translated by 谷歌翻译

Biomedical image analysis competitions: The state of current participation practice

Matthias Eisenmann , Annika Reinke , Vivienn Weru , Minu Dietlinde Tizabi , Fabian Isensee , Tim J. Adler , Patrick Godau , Veronika Cheplygina , Michal Kozubek , Sharib Ali

分类：计算机视觉 | 机器学习

2022-12-16

The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis, we designed an international survey that was issued to all participants of challenges conducted in conjunction with the IEEE ISBI 2021 and MICCAI 2021 conferences (80 competitions in total). The survey covered participants' expertise and working environments, their chosen strategies, as well as algorithm characteristics. A median of 72% challenge participants took part in the survey. According to our results, knowledge exchange was the primary incentive (70%) for participation, while the reception of prize money played only a minor role (16%). While a median of 80 working hours was spent on method development, a large portion of participants stated that they did not have enough time for method development (32%). 25% perceived the infrastructure to be a bottleneck. Overall, 94% of all solutions were deep learning-based. Of these, 84% were based on standard architectures. 43% of the respondents reported that the data samples (e.g., images) were too large to be processed at once. This was most commonly addressed by patch-based training (69%), downsampling (37%), and solving 3D analysis tasks as a series of 2D tasks. K-fold cross-validation on the training set was performed by only 37% of the participants and only 50% of the participants performed ensembling based on multiple identical models (61%) or heterogeneous models (39%). 48% of the respondents applied postprocessing steps.

translated by 谷歌翻译

E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model

W. Ronny Huang , Shuo-Yiin Chang , Tara N. Sainath , Yanzhang He , David Rybach , Robert David , Rohit Prabhavalkar , Cyril Allauzen , Cal Peyser , Trevor D. Strohman

分类：自然语言处理

2022-11-28

We explore unifying a neural segmenter with two-pass cascaded encoder ASR into a single model. A key challenge is allowing the segmenter (which runs in real-time, synchronously with the decoder) to finalize the 2nd pass (which runs 900 ms behind real-time) without introducing user-perceived latency or deletion errors during inference. We propose a design where the neural segmenter is integrated with the causal 1st pass decoder to emit a end-of-segment (EOS) signal in real-time. The EOS signal is then used to finalize the non-causal 2nd pass. We experiment with different ways to finalize the 2nd pass, and find that a novel dummy frame injection strategy allows for simultaneous high quality 2nd pass results and low finalization latency. On a real-world long-form captioning task (YouTube), we achieve 2.4% relative WER and 140 ms EOS latency gains over a baseline VAD-based segmenter with the same cascaded encoder.

translated by 谷歌翻译

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Teven Le Scao , Angela Fan , Christopher Akiki , Ellie Pavlick , Suzana Ilić , Daniel Hesslow , Roman Castagné , Alexandra Sasha Luccioni , François Yvon , Matthias Gallé

分类：自然语言处理

2022-11-09

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License.

translated by 谷歌翻译

ShaSTA: Modeling Shape and Spatio-Temporal Affinities for 3D Multi-Object Tracking

Tara Sadjadpour , Jie Li , Rares Ambrus , Jeannette Bohg

分类：计算机视觉

2022-11-08

Multi-object tracking is a cornerstone capability of any robotic system. Most approaches follow a tracking-by-detection paradigm. However, within this framework, detectors function in a low precision-high recall regime, ensuring a low number of false-negatives while producing a high rate of false-positives. This can negatively affect the tracking component by making data association and track lifecycle management more challenging. Additionally, false-negative detections due to difficult scenarios like occlusions can negatively affect tracking performance. Thus, we propose a method that learns shape and spatio-temporal affinities between consecutive frames to better distinguish between true-positive and false-positive detections and tracks, while compensating for false-negative detections. Our method provides a probabilistic matching of detections that leads to robust data association and track lifecycle management. We quantitatively evaluate our method through ablative experiments and on the nuScenes tracking benchmark where we achieve state-of-the-art results. Our method not only estimates accurate, high-quality tracks but also decreases the overall number of false-positive and false-negative tracks. Please see our project website for source code and demo videos: sites.google.com/view/shasta-3d-mot/home.

translated by 谷歌翻译

Resource-Efficient Transfer Learning From Speech Foundation Model Using Hierarchical Feature Fusion

Zhouyuan Huo , Khe Chai Sim , Bo Li , Dongseong Hwang , Tara N. Sainath , Trevor Strohman

分类：机器学习

2022-11-04

Self-supervised pre-training of a speech foundation model, followed by supervised fine-tuning, has shown impressive quality improvements on automatic speech recognition (ASR) tasks. Fine-tuning separate foundation models for many downstream tasks are expensive since the foundation model is usually very big. Parameter-efficient fine-tuning methods (e.g. adapter, sparse update methods) offer an alternative paradigm where a small set of parameters are updated to adapt the foundation model to new tasks. However, these methods still suffer from a high computational memory cost and slow training speed because they require backpropagation through the entire neural network at each step. In the paper, we analyze the performance of features at different layers of a foundation model on the speech recognition task and propose a novel hierarchical feature fusion method for resource-efficient transfer learning from speech foundation models. Experimental results show that the proposed method can achieve better performance on speech recognition task than existing algorithms with fewer number of trainable parameters, less computational memory cost and faster training speed. After combining with Adapters at all layers, the proposed method can achieve the same performance as fine-tuning the whole model with $97\%$ fewer trainable encoder parameters and $53\%$ faster training speed.

translated by 谷歌翻译

Developing, Evaluating and Scaling Learning Agents in Multi-Agent Environments

Ian Gemp , Thomas Anthony , Yoram Bachrach , Avishkar Bhoopchand , Kalesha Bullard , Jerome Connor , Vibhavari Dasagi , Bart De Vylder , Edgar Duenez-Guzman , Romuald Elie

分类：人工智能

2022-09-22

DeepMind的游戏理论与多代理团队研究多学科学习的几个方面，从计算近似值到游戏理论中的基本概念，再到在富裕的空间环境中模拟社会困境，并在困难的团队协调任务中培训3-D类人动物。我们小组的一个签名目的是使用DeepMind在DeepMind中提供的资源和专业知识，以深入强化学习来探索复杂环境中的多代理系统，并使用这些基准来提高我们的理解。在这里，我们总结了我们团队的最新工作，并提出了一种分类法，我们认为这重点介绍了多代理研究中许多重要的开放挑战。

translated by 谷歌翻译